Model Selection

Multi-Scenario Adaptation

# Multi-Scenario Adaptation

TRELLIS Image Large

The image-conditioned version of TRELLIS is a large-scale 3D generation model capable of generating 3D content from images.

3D Vision English

TRELLIS Image Large Fork

TRELLIS is a large-scale 3D generation model that achieves scalable and versatile 3D content creation through structured 3D latent variables.

3D Vision English

Bge Large Zh V1.5 GGUF

BAAI/bge-large-zh-v1.5 is a Chinese sentence transformer model primarily used for feature extraction and sentence similarity calculation.

Text Embedding Chinese

Ade20k Semantic Eomt Large 512

This model is developed based on the paper 'Your ViT is Actually an Image Segmentation Model' and is a Vision Transformer model for image segmentation tasks.

Image Segmentation

Light R1 14B DS GGUF

Light-R1-14B-DS is a 14B-parameter quantized large language model supporting text generation tasks, designed for efficient inference in resource-constrained environments.

Large Language Model

Huihui Ai.granite Vision 3.2 2b Abliterated GGUF

Granite Vision 3.2 2B Abliterated is a vision-language model focused on image-to-text conversion tasks.

Skyreels V1 Hunyuan I2V HFIE

SkyReels-V1-Hunyuan-I2V is a text-to-video generation model developed by Tencent SkyworkAI, based on the Hunyuan architecture, supporting video content generation from text input.

Text-to-Video English

70B L3.3 Mhnnn X1

A large language model fine-tuned based on Llama-3-70B-Instruct, specializing in creative text generation and multi-task processing

Large Language Model

Japanese Parler Tts Large Bate

A Japanese text-to-speech model fine-tuned based on parler-tts-large-v1, capable of generating high-quality Japanese speech

Speech Synthesis

Transformers Japanese

This is an end-to-end speaker segmentation model for voice activity detection, overlap speech detection, and resegmentation tasks.

Audio Processing

Smollm2 Prompt Enhance GGUF

SmolLM2-Prompt-Enhance is a text generation model fine-tuned based on SmolLM2-135M-Instruct, focusing on prompt enhancement tasks.

Large Language Model

Transformers English

Elizabeth Olsen Sdxl Flux

A LoRA model customized based on FLUX.1-dev foundation model, specializing in generating photorealistic images of Elizabeth Olsen (particularly as Marvel's Scarlet Witch)

Stable Diffusion V1.5

A latent diffusion model for text-to-image generation, supporting 512x512 resolution image generation

Image Generation

stablediffusiontutorials

Moondream Caption

A customized small vision model based on Moondream2, fine-tuned specifically for image caption generation tasks

YOLOv10 is a real-time end-to-end object detection model proposed by Tsinghua University, known for its efficiency and accuracy.

Object Detection

YOLOv10 is a real-time end-to-end object detection model that offers a balance between efficient detection performance and accuracy.

Object Detection

BuRP is a versatile roleplay model capable of highly interactive engagement with users, never rejecting any proactive requests while strictly adhering to specific dialogue formats.

Large Language Model

Transformers English

ChaoticNeutrals

Promcse Bert Base Zh

PromCSE is a supervised learning-based sentence embedding model specifically designed for calculating Chinese sentence similarity.

Transformers Chinese

Speecht5 Finetuned Zh TW

A speech processing model based on the SpeechT5 architecture, fine-tuned for Taiwanese Mandarin

Speech Synthesis

Car Brands Classification

A pre-trained image classification model based on the BEiT architecture, supporting Vietnamese labels, suitable for vision tasks

Image Classification

Transformers Other

A model for generating prompts for Stable Diffusion models

Image Generation English

SBERT JSNLI Base

This is a model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space for tasks such as sentence similarity calculation, clustering, and semantic search.

Sdg Sentence Transformer

This is a model based on sentence-transformers that maps sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as sentence similarity calculation and semantic search.

This is a model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering and semantic search.

Bpr Gpl Webis Touche2020 Base Msmarco Distilbert Tas B

This is a model based on sentence-transformers that can map sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Cifar 10 Vgg Pretrained

Image classification model implemented with PyTorch, capable of recognizing multiple common object categories

Image Classification

Voice Activity Detection

Voice activity detection model based on pyannote.audio 2.1, used to identify speech activity segments in audio

Speech Recognition

Koelectra Base V3 Generalized Sentiment Analysis

Korean sentiment analysis model based on KoELECTRA-v3, used to determine the positive or negative sentiment tendency of text.

Text Classification

Transformers Korean

Bertweet Base Emotion Analysis

English sentiment analysis model trained on the EmoEvent corpus, utilizing the BERTweet architecture

Text Classification

Transformers English

Paraphrase MiniLM L6 V2

This is a sentence transformer model that maps sentences and paragraphs into a 384-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase